Search CORE

47 research outputs found

Comparative genomics of regulation of heavy metal resistance in Eubacteria

Author: Gelfand MS
Kalinina OV
Kazakov AE
Permina EA
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Heavy metal resistance (HMR) in Eubacteria is regulated by a variety of systems including transcription factors from the MerR family (COG0789). The HMR systems are characterized by the complex signal structure (strong palindrome within a 19 or 20 bp promoter spacer), and usually consist of transporter and regulator genes. Some HMR regulons also include detoxification systems. The number of sequenced bacterial genomes is constantly increasing and even though HMR resistance regulons of the COG0789 type usually consist of few genes per genome, the computational analysis may contribute to the understanding of the cellular systems of metal detoxification. RESULTS: We studied the mercury (MerR), copper (CueR and HmrR), cadmium (CadR), lead (PbrR), and zinc (ZntR) resistance systems and demonstrated that combining protein sequence analysis and analysis of DNA regulatory signals it was possible to distinguish metal-dependent members of COG0789, assign specificity towards particular metals to uncharacterized loci, and find new genes involved in the metal resistance, in particular, multicopper oxidase and copper chaperones, candidate cytochromes from the copper regulon, new cadmium transporters and, possibly, glutathione-S-transferases. CONCLUSION: Our data indicate that the specificity of the COG0789 systems can be determined combining phylogenetic analysis and identification of DNA regulatory sites. Taking into account signal structure, we can adequately identify genes that are activated using the DNA bending-unbending mechanism. In the case of regulon members that do not reside in single loci, analysis of potential regulatory sites could be crucial for the correct annotation and prediction of the specificity

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Ensemble approach to predict specificity determinants: benchmarking and validation

Author: A Carro
A del Sol
AA Schäffer
Anna R Panchenko
B Reva
DP Brown
E Marchiori
HM Berman
I Kononenko
IM Wallace
J Pei
JA Capra
JE Donald
K Mizuguchi
K Ye
L Mirny
N Krishnamurthy
O Lichtarge
OV Kalinina
OV Kalinina
P Marttinen
RF Doolittle
RM Ward
S Chakrabarti
S Chakrabarti
S Ohno
Saikat Chakrabarti
SS Hannenhalli
W Pirovano
WL DeLano
X Gu
X Gu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background It is extremely important and challenging to identify the sites that are responsible for functional specification or diversification in protein families. In this study, a rigorous comparative benchmarking protocol was employed to provide a reliable evaluation of methods which predict the specificity determining sites. Subsequently, three best performing methods were applied to identify new potential specificity determining sites through ensemble approach and common agreement of their prediction results. Results It was shown that the analysis of structural characteristics of predicted specificity determining sites might provide the means to validate their prediction accuracy. For example, we found that for smaller distances it holds true that the more reliable the prediction method is, the closer predicted specificity determining sites are to each other and to the ligand. Conclusion We observed certain similarities of structural features between predicted and actual subsites which might point to their functional relevance. We speculate that majority of the identified potential specificity determining sites might be indirectly involved in specific interactions and could be ideal target for mutagenesis experiments.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Supervised multivariate analysis of sequence groups to identify specificity determining residues

Author: A Carro
A del Sol Mesa
AC Culhane
AC Culhane
AR Fersht
CD Livingstone
CL Tucker
D Charif
Desmond G Higgins
DG Higgins
DH Morgan
E Beitz
F Pazos
G Casari
G Zhang
H Yao
HM Wilks
Iain M Wallace
J Thioulouse
JC Gower
JD Thompson
JG Henikoff
KM Mayer
L Yuan
LA Mirny
M Clamp
N Saitou
O Lichtarge
OV Kalinina
OV Kalinina
RC Gentleman
RD Finn
RJ Edwards
S Dolédec
S Henikoff
SJ Hubbard
SS Hannenhalli
TD Schneider
V Vacic
W Pirovano
WR Atchley
X Gu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments. Results We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids. Conclusion This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Combining specificity determining and conserved residues improves functional site prediction

Author: A Carro
A del Sol Mesa
A Shulman-Peleg
A Stark
A Stark
A Teplyakov
AE Todd
ATR Laurie
B Ma
B Mirkin
B Reva
B Zambelli
BJ Polacco
C Romier
C Yeats
CT Porter
DA Rodionov
EA Gaucher
G Dodson
G Koczyk
G Wu
GJ Kleywegt
H Yao
IM Wallace
IN Shindyalov
J Capra
J Dundas
J Pei
J-M Chandonia
JA Capra
JE Donald
JR Manning
K Ye
K Ye
KA Feenstra
KM Mayer
L Aravind
L Holm
LA Mirny
M Hendlich
M Landau
MA Willis
Mikhail S Gelfand
O Lichtarge
Olga V Kalinina
OV Kalinina
OV Kalinina
P Aloy
PP Khil
PP Khil
R Landgraf
RD Finn
RJ Edwards
Robert B Russell
S Ahmad
S Chakrabarti
S Sankararaman
S Whelan
SS Hannenhalli
T Maier
T Pupko
WR Taylor
WSJ Valdar
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Predicting the location of functionally important sites from protein sequence and/or structure is a long-standing problem in computational biology. Most current approaches make use of sequence conservation, assuming that amino acid residues conserved within a protein family are most likely to be functionally important. Most often these approaches do not consider many residues that act to define specific sub-functions within a family, or they make no distinction between residues important for function and those more relevant for maintaining structure (e.g. in the hydrophobic core). Many protein families bind and/or act on a variety of ligands, meaning that conserved residues often only bind a common ligand sub-structure or perform general catalytic activities. Results Here we present a novel method for functional site prediction based on identification of conserved positions, as well as those responsible for determining ligand specificity. We define Specificity-Determining Positions (SDPs), as those occupied by conserved residues within sub-groups of proteins in a family having a common specificity, but differ between groups, and are thus likely to account for specific recognition events. We benchmark the approach on enzyme families of known 3D structure with bound substrates, and find that in nearly all families residues predicted by SDPsite are in contact with the bound substrate, and that the addition of SDPs significantly improves functional site prediction accuracy. We apply SDPsite to various families of proteins containing known three-dimensional structures, but lacking clear functional annotations, and discusse several illustrative examples. Conclusion The results suggest a better means to predict functional details for the thousands of protein structures determined prior to a clear understanding of molecular function.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

H2r: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments

Author: A del Sol Mesa
AL Barabási
B Rost
C Notredame
C Ouzounis
C Sander
C Steegborn
CC Hyde
CE Shannon
D Altschuh
DR Caffrey
E Eyal
E Neher
E Weber-Ban
E Zuckerkandl
ER Tillier
F Pearl
GB Gloor
GM Süel
HO Villar
I Kass
IM Wallace
J Tsai
JA Capra
JP Dekker
K Katoh
K Wang
LA Kelley
LC Martin
M Landau
Matthias Zwick
MC Saraf
ME Noble
O Noivirt
O Olmea
OV Kalinina
OV Kalinina
R Merkl
RA Estabrook
RA Laskowski
Rainer Merkl
RD Finn
RI Dima
S Henikoff
SJ Fleishman
SM Larson
SW Lockless
T Lassmann
T Sato
TD Schneider
U Göbel
V Kulik
V Kulik
WH Press
WR Atchley
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-variation of amino acid frequencies offers insights into function and evolution of the protein and residues. RESULTS: We introduce conn(k), a novel parameter for the characterisation of individual residues. For each residue k, conn(k) is the number of most extreme signals of co-evolution. These signals were deduced from a normalised mutual information (MI) value U(k, l) computed for all pairs of residues k, l. We demonstrate that conn(k) is a more robust indicator than an individual MI-value for the prediction of residues most plausibly important for the evolution of a protein. This proposition was inferred by means of statistical methods. It was further confirmed by the analysis of several proteins. A server, which computes conn(k)-values is available at http://www-bioinf.uni-regensburg.de. CONCLUSION: The algorithms H2r, which analyses MSAs and computes conn(k)-values, characterises a specific class of residues. In contrast to strictly conserved ones, these residues possess some flexibility in the composition of side chains. However, their allocation is sensibly balanced with several other positions, as indicated by conn(k)

University of Regensburg Publication Server

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Structure-guided selection of specificity determining positions in the human kinome

Author: A Rausell
BY Chen
C Schalon
D Huang
D Kuhn
DH Bryant
F Milletti
F Pazos
GE Crooks
I Halperin
I Steinwart
IT Jolliffe
J Blanc
J Mok
JA Bikker
JA Capra
L Xie
L Xing
Lydia E. Kavraki
M Menke
M Moll
Mark Moll
MC Heinrich
MW Karaman
O Weigert
OC Redfern
OV Kalinina
Paul W. Finn
PW Finn
RC de Melo-Minardi
RD Finn
RK Kancha
S Chakrabarti
S Redaelli
SB Hari
SL Kinnings
T Liu
T Trowe
Y Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/08/2016
Field of study

Background: The human kinome contains many important drug targets. It is well-known that inhibitors of protein kinases bind with very different selectivity profiles. This is also the case for inhibitors of many other protein families. The increased availability of protein 3D structures has provided much information on the structural variation within a given protein family. However, the relationship between structural variations and binding specificity is complex and incompletely understood. We have developed a structural bioinformatics approach which provides an analysis of key determinants of binding selectivity as a tool to enhance the rational design of drugs with a specific selectivity profile. Results: We propose a greedy algorithm that computes a subset of residue positions in a multiple sequence alignment such that structural and chemical variation in those positions helps explain known binding affinities. By providing this information, the main purpose of the algorithm is to provide experimentalists with possible insights into how the selectivity profile of certain inhibitors is achieved, which is useful for lead optimization. In addition, the algorithm can also be used to predict binding affinities for structures whose affinity for a given inhibitor is unknown. The algorithm’s performance is demonstrated using an extensive dataset for the human kinome. Conclusion: We show that the binding affinity of 38 different kinase inhibitors can be explained with consistently high precision and accuracy using the variation of at most six residue positions in the kinome binding site. We show for several inhibitors that we are able to identify residues that are known to be functionally important

BEAR (Buckingham E-Archive of Research)

Crossref

PubMed Central

DSpace at Rice University

Glutamine versus Ammonia Utilization in the NAD Synthetase Family

NAD is a ubiquitous and essential metabolic redox cofactor which also functions as a substrate in certain regulatory pathways. The last step of NAD synthesis is the ATP-dependent amidation of deamido-NAD by NAD synthetase (NADS). Members of the NADS family are present in nearly all species across the three kingdoms of Life. In eukaryotic NADS, the core synthetase domain is fused with a nitrilase-like glutaminase domain supplying ammonia for the reaction. This two-domain NADS arrangement enabling the utilization of glutamine as nitrogen donor is also present in various bacterial lineages. However, many other bacterial members of NADS family do not contain a glutaminase domain, and they can utilize only ammonia (but not glutamine) in vitro. A single-domain NADS is also characteristic for nearly all Archaea, and its dependence on ammonia was demonstrated here for the representative enzyme from Methanocaldococcus jannaschi. However, a question about the actual in vivo nitrogen donor for single-domain members of the NADS family remained open: Is it glutamine hydrolyzed by a committed (but yet unknown) glutaminase subunit, as in most ATP-dependent amidotransferases, or free ammonia as in glutamine synthetase? Here we addressed this dilemma by combining evolutionary analysis of the NADS family with experimental characterization of two representative bacterial systems: a two-subunit NADS from Thermus thermophilus and a single-domain NADS from Salmonella typhimurium providing evidence that ammonia (and not glutamine) is the physiological substrate of a typical single-domain NADS. The latter represents the most likely ancestral form of NADS. The ability to utilize glutamine appears to have evolved via recruitment of a glutaminase subunit followed by domain fusion in an early branch of Bacteria. Further evolution of the NADS family included lineage-specific loss of one of the two alternative forms and horizontal gene transfer events. Lastly, we identified NADS structural elements associated with glutamine-utilizing capabilities

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

The FGGY carbohydrate kinase family : insights into the evolution of functional specificities

Author: A Osterman
A Vendeville
Adam Godzik
AE Todd
AE Todd
AM Schnoes
Andrei Osterman
B Reva
BE Engelhardt
BG Magor
CA Bonner
CA Orengo
Christos A. Ouzounis
CM Seibert
D Grueninger
D Wu
DA Lee
DA Rodionov
E Di Luccio
G Casari
GE Crooks
HM Berman
I Letunic
Irina Rodionova
JA Capra
JA Capra
JA Gerlt
JH Hurley
JH Hurley
JI Yeh
K Sjolander
K Ye
KB Xavier
LA David
M Ormo
M Pachkov
ME Glasner
MN Price
MV Omelchenko
N Krishnamurthy
Olga Zagnitko
OV Kalinina
P Shannon
R Overbeek
RC Edgar
RC Edgar
RD Finn
RK Aziz
S Cheek
SS Hannenhalli
TA Tatusova
TT Nguyen
W-D Fessner
Y Zhang
Ying Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/12/2011
Field of study

© The Author(s), 2011. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in PLoS Computational Biology 7 (2011): e1002318, doi:10.1371/journal.pcbi.1002318.Function diversification in large protein families is a major mechanism driving expansion of cellular networks, providing organisms with new metabolic capabilities and thus adding to their evolutionary success. However, our understanding of the evolutionary mechanisms of functional diversity in such families is very limited, which, among many other reasons, is due to the lack of functionally well-characterized sets of proteins. Here, using the FGGY carbohydrate kinase family as an example, we built a confidently annotated reference set (CARS) of proteins by propagating experimentally verified functional assignments to a limited number of homologous proteins that are supported by their genomic and functional contexts. Then, we analyzed, on both the phylogenetic and the molecular levels, the evolution of different functional specificities in this family. The results show that the different functions (substrate specificities) encoded by FGGY kinases have emerged only once in the evolutionary history following an apparently simple divergent evolutionary model. At the same time, on the molecular level, one isofunctional group (L-ribulokinase, AraB) evolved at least two independent solutions that employed distinct specificity-determining residues for the recognition of a same substrate (L-ribulose). Our analysis provides a detailed model of the evolution of the FGGY kinase family. It also shows that only combined molecular and phylogenetic approaches can help reconstruct a full picture of functional diversifications in such diverse families.This study was funded by NIH and DOE grants

Public Library of Science (PLOS)

Crossref

Woods Hole Open Access Server

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Using Shifts in Amino Acid Frequency and Substitution Rate to Identify Latent Structural Characters in Base-Excision Repair Enzymes

Author: A Bateman
A del Sol
A Gutteridge
A Gutteridge
A Marchler-Bauer
A Stamatakis
AB Robertson
AH Elcock
AN Barclay
AR Panchenko
AT Laurie
B Kolaczkowski
B Reva
C Branden
C Notredame
DA Kraut
DE Pumo
DM Standley
DO Zharkov
DO Zharkov
DO Zharkov
DP Brown
DR Caffrey
DT Jones
E Deu
E Hodis
E Martz
E Youn
EA Gaucher
EC Friedberg
F Coste
G Casari
G Golan
G Nimrod
GJ Naylor
H Hirano
I Mihalek
IN Sarkar
J Felsenstein
J Ko
J Pei
JA Capra
JA Capra
JC Fromme
Jeffrey P. Bond
JM Koshi
K Imamura
K Katoh
K Pereira de Jesus
KD Pruitt
KP Peters
KY Kropachev
L Rabow
LA Mirny
LE Limbird
M Clamp
M Guharoy
M Landau
M Rogacheva
M Saparbaev
M Sugahara
MJ Ondrechen
N Galtier
N Miyatake
NV Petrova
O Lichtarge
O Rahat
O Schueler-Furman
OM Sidorkina
OV Kalinina
P Aloy
P Amara
P Lio
P Lopez
P Marttinen
Q Cheng
R Gilboa
R Landgraf
RA George
Ramiro Barrantes-Reynolds
S Ahmad
S Burgess
S Doublie
S Gribaldo
S Henikoff
S Madabushi
S Sankararaman
S Wolfram
SD Kathe
Sebastian D. Fugmann
SF Altschul
SS Hannenhalli
SS Wallace
Susan S. Wallace
SV Kuznetsov
V Bandaru
V Ruano-Rubio
WL Delano
X Gu
X Gu
X Gu
X Gu
X Gu
X Gu
X Gu
Z Yang
Publication venue: Public Library of Science
Publication date: 06/10/2011
Field of study

Protein evolution includes the birth and death of structural motifs. For example, a zinc finger or a salt bridge may be present in some, but not all, members of a protein family. We propose that such transitions are manifest in sequence phylogenies as concerted shifts in substitution rates of amino acids that are neighbors in a representative structure. First, we identified rate shifts in a quartet from the Fpg/Nei family of base excision repair enzymes using a method developed by Xun Gu and coworkers. We found the shifts to be spatially correlated, more precisely, associated with a flexible loop involved in bacterial Fpg substrate specificity. Consistent with our result, sequences and structures provide convincing evidence that this loop plays a very different role in other family members. Second, then, we developed a method for identifying latent protein structural characters (LSC) given a set of homologous sequences based on Gu's method and proximity in a high-resolution structure. Third, we identified LSC and assigned states of LSC to clades within the Fpg/Nei family of base excision repair enzymes. We describe seven LSC; an accompanying Proteopedia page (http://proteopedia.org/wiki/index.php/Fpg_Nei_Protein_Family) describes these in greater detail and facilitates 3D viewing. The LSC we found provided a surprisingly complete picture of the interaction of the protein with the DNA capturing familiar examples, such as a Zn finger, as well as more subtle interactions. Their preponderance is consistent with an important role as phylogenetic characters. Phylogenetic inference based on LSC provided convincing evidence of independent losses of Zn fingers. Structural motifs may serve as important phylogenetic characters and modeling transitions involving structural motifs may provide a much deeper understanding of protein evolution

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Computing Highly Correlated Positions Using Mutual Information and Graph Theory for G Protein-Coupled Receptors

Author: A Pagano
AA Ivanov
AK Ramani
AR Ortiz
B Galitsky
BTM Korber
C Goh
C Hemmerich
C Yeang
Carson C. Chow
CD Strader
CE Shannon
CJ Harris
CS Sum
D Altschuh
DD Pollock
DD Pollock
DKY Chiu
DM Rosenbaum
E Neher
F Horn
F Knoflach
F Pazos
F Pazos
FY Carroll
G Casari
G Kleinau
G Kleinau
G Suel
G Swaminath
GB Gloor
H Herzel
H Jaschke
I Halperin
I Kass
IG Tikhonova
IN Shindyalov
J Dutheil
J Kim
J Thomas
JA Ballesteros
JA Ballesteros
JA Capra
JE Donald
JE Donald
JE Donald
JS Surgand
JW Kelly
JX Hu
K Palczewski
K Ray
K Ray
K Sjolander
K Ye
K Ye
KD Pruitt
KL Pierce
KY Yip
L Lewyn
L Oliveira
L Oliveira
L Oliveira
L Pritchard
LA Mirny
LC Martin
LH Heitman
M Raviscioni
M Scarselli
M Socolich
MA Hanson
Matthieu Louis
ME Olah
MJ Buck
ML Lopez-Rodriguez
MS Roulston
MW Dimmic
ND Clarke
NG Hoffman
O Lichtarge
O Lichtarge
O Noivirt
OF Lange
OV Kalinina
PJ Kundrotas
PR Gouldson
R Banerjee
R Brun
R Fredriksson
R Jothi
R Steuer
RI Dima
RM Williamson
RR Gutell
S Chakrabarti
S Costanzi
S Costanzi
S Costanzi
S Costanzi
S Govindarajan
S Litschig
S Madabushi
S Moore
S Moro
S Ohno
S Takeda
Sarosh N. Fatakia
SB Nagl
SB Nagl
SD Dunn
SGF Rasmussen
SJ Fleishman
SS Hannenhalli
Stefano Costanzi
SW Lockless
T Klabunde
T Klabunde
T Sato
T Warne
TD Schneider
TM Cover
V Batageli
V Cherezov
VP Jaakola
WP Russ
WR Atchley
WR Atchley
WR Taylor
Y Liu
Y Qi
Publication venue: Public Library of Science
Publication date: 05/03/2009
Field of study

G protein-coupled receptors (GPCRs) are a superfamily of seven transmembrane-spanning proteins involved in a wide array of physiological functions and are the most common targets of pharmaceuticals. This study aims to identify a cohort or clique of positions that share high mutual information. Using a multiple sequence alignment of the transmembrane (TM) domains, we calculated the mutual information between all inter-TM pairs of aligned positions and ranked the pairs by mutual information. A mutual information graph was constructed with vertices that corresponded to TM positions and edges between vertices were drawn if the mutual information exceeded a threshold of statistical significance. Positions with high degree (i.e. had significant mutual information with a large number of other positions) were found to line a well defined inter-TM ligand binding cavity for class A as well as class C GPCRs. Although the natural ligands of class C receptors bind to their extracellular N-terminal domains, the possibility of modulating their activity through ligands that bind to their helical bundle has been reported. Such positions were not found for class B GPCRs, in agreement with the observation that there are not known ligands that bind within their TM helical bundle. All identified key positions formed a clique within the MI graph of interest. For a subset of class A receptors we also considered the alignment of a portion of the second extracellular loop, and found that the two positions adjacent to the conserved Cys that bridges the loop with the TM3 qualified as key positions. Our algorithm may be useful for localizing topologically conserved regions in other protein families

Public Library of Science (PLOS)

Crossref

PubMed Central